Table Image Understanding
نویسندگان
چکیده
We are developing a system to perform table image understanding. The recognition problem is to locate and characterize the cells of a table in a two-dimensional black and white document image. More meaning is implicit in the geometric location of the image regions (cells) of a table than in most other document types. Ideally, we wish to develop a strategy for extracting the underlying relational information from the image given its visual clues such as ruling lines, space separations, etc. Various strategies of achieving this physical to logical mapping of table information are presented.
منابع مشابه
Interactive Learning of a Multiple-Attribute Hash Table Classifier for Fast Object Recognition
متن کامل
Recovering ball motion from a single motion-blurred image
Motion blur often affects the ball image in photographs and video frames in many sports such as tennis, table tennis, squash and golf. In this work we operate on a single calibrated image depicting a moving ball over a known background, and show that motion blurred ball images, usually unwelcome in computer vision, bear more information than a sharp image. We provide techniques for extracting s...
متن کاملTable Structure Identification from Document Images: A Survey
Table structure identification has received significant research attention in the past few years. The OCR (Optical Character Recognition) has faced many potential errors in the document images, so it is strongly required to make logical objects such as table explicit. So to get the deep understanding about the contents of the document, proper understanding of the document is required by means o...
متن کاملLayout and Language: Challenges for Table Understanding on the Web
In this paper, we consider the table understanding task and present a catalogue of particular issues that arise when the tables are those found on the web. In addition, we consider what happens when processes commonly associated with web pages are applied to those bearing tables. 1 Table Understanding and the Web The ubiquity of tables, and their ability to describe relational information in a ...
متن کاملLinear-time connected-component labeling based on sequential local operations
This paper presents a fast algorithm for labeling connected components in binary images based on sequential local operations. A one-dimensional table, which memorizes label equivalences, is used for uniting equivalent labels successively during the operations in forward and backward raster directions. The proposed algorithm has a desirable characteristic: the execution time is directly proporti...
متن کامل